feat(example): add rl-training example #31

hellomypastor · 2025-12-23T08:52:12Z

Summary

Added a new examples/rl-training example demonstrating RL training (CartPole + DQN) inside OpenSandbox, including dependency installation, training, checkpointing, and summary output. Also updated
the examples index and aligned dependencies with the default opensandbox/code-interpreter:latest image. Resolves Add Reinforcement Learning (RL) Sandbox Example #29.

Testing

Not run (example is environment-dependent; requires running OpenSandbox server and sandbox image)
Unit tests
Integration tests
e2e / manual verification

Breaking Changes

None
Yes (describe impact and migration path)

Checklist

Linked Issue or clearly described motivation
Added/updated docs (if needed)
Added/updated tests (if needed)
Security impact considered
Backward compatibility considered

jwx0925 · 2025-12-25T01:16:49Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

examples/rl-training/main.py

hittyt

LGTM

hellomypastor requested review from Pangjiping, hittyt, jwx0925 and ninan-nn as code owners December 23, 2025 08:52

chatgpt-codex-connector bot reviewed Dec 25, 2025

View reviewed changes

examples/rl-training/main.py Outdated Show resolved Hide resolved

hittyt reviewed Jan 12, 2026

View reviewed changes

examples/rl-training/main.py Outdated Show resolved Hide resolved

hellomypastor force-pushed the feat/rl-example branch from 55f58d5 to 3bb40ac Compare January 12, 2026 03:44

feat(example): add rl-training example

ec3b28b

hellomypastor force-pushed the feat/rl-example branch from 3bb40ac to ec3b28b Compare January 12, 2026 03:46

hittyt self-requested a review January 12, 2026 03:49

hittyt approved these changes Jan 12, 2026

View reviewed changes

hittyt merged commit 5867983 into alibaba:main Jan 12, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(example): add rl-training example #31

feat(example): add rl-training example #31

Uh oh!

hellomypastor commented Dec 23, 2025

Uh oh!

jwx0925 commented Dec 25, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Uh oh!

hittyt left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(example): add rl-training example #31

feat(example): add rl-training example #31

Uh oh!

Conversation

hellomypastor commented Dec 23, 2025

Summary

Testing

Breaking Changes

Checklist

Uh oh!

jwx0925 commented Dec 25, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

hittyt left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants